Group 18: FINDING GENE PATTERNS IN BREAST CANCER DATA
Introduction:
Question: Which genes are differentially expressed in different subtypes of cancer?
General workflow
![]()
General wokflow
EXPLORATORY ANALYSIS AND TIDY:
![]()
Cleaning procedure
EXPLORATORY ANALYSIS AND TIDY:
DESEQ Analysis:
![]()
DESEQ workflow
PCA Analysis:
Here is an analysis of PCA plots showing the scree and cumulative variance explained.
The high dimentionality required to explain 85% of the variablity of the data shows that cancer is a difficult task
PCA Analysis:
- The highlighted genes for each PC might be linked to specific biological pathways or processes, as they represent the main drivers of variance for the data.
Discussion: Biological insights
- We can see that the DE genes in the data affect most importantly X pathways
- This makes/not makes sense with the literature as 1,2,3